NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs

https://doi.org/10.1145/3577193.3593725

Chen, Jou-An; Sung, Hsin-Hsuan; Shen, Xipeng; Choudhury, Sutanay; Li, Ang (June 2023, ACM)

Full Text Available
Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design

https://doi.org/10.18653/v1/2023.findings-emnlp.560

Sprueill, Henry; Edwards, Carl; Olarte, Mariefel; Sanyal, Udishnu; Ji, Heng; Choudhury, Sutanay (January 2023, Association for Computational Linguistics)

Full Text Available
Cloud Services Enable Efficient AI-Guided Simulation Workflows across Heterogeneous Resources

https://doi.org/10.1109/IPDPSW59300.2023.00018

Ward, Logan; Pauloski, J. Gregory; Hayot-Sasson, Valerie; Chard, Ryan; Babuji, Yadu; Sivaraman, Ganesh; Choudhury, Sutanay; Chard, Kyle; Thakur, Rajeev; Foster, Ian (May 2023, IEEE)
Attention-based aspect reasoning for knowledge base question answering on clinical notes

https://doi.org/10.1145/3535508.3545518

Wang, Ping; Shi, Tian; Agarwal, Khushbu; Choudhury, Sutanay; Reddy, Chandan K. (August 2022, BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics)

Full Text Available
GEDet: detecting erroneous nodes with a few examples

https://doi.org/10.14778/3476311.3476367

Guan, Sheng; Ma, Hanchao; Choudhury, Sutanay; Wu, Yinghui (July 2021, Proceedings of the VLDB Endowment)

Full Text Available
Preparing for the next pandemic via transfer learning from existing diseases with hierarchical multi-modal BERT: a study on COVID-19 outcome prediction

https://doi.org/10.1038/s41598-022-13072-w

Agarwal, Khushbu; Choudhury, Sutanay; Tipirneni, Sindhu; Mukherjee, Pritam; Ham, Colby; Tamang, Suzanne; Baker, Matthew; Tang, Siyi; Kocaman, Veysel; Gevaert, Olivier; et al (June 2022, Scientific Reports)

Abstract Developing prediction models for emerging infectious diseases from relatively small numbers of cases is a critical need for improving pandemic preparedness. Using COVID-19 as an exemplar, we propose a transfer learning methodology for developing predictive models from multi-modal electronic healthcare records by leveraging information from more prevalent diseases with shared clinical characteristics. Our novel hierarchical, multi-modal model ($${\textsc {TransMED}}$$ $T R A N S MED$ ) integrates baseline risk factors from the natural language processing of clinical notes at admission, time-series measurements of biomarkers obtained from laboratory tests, and discrete diagnostic, procedure and drug codes. We demonstrate the alignment of$${\textsc {TransMED}}$$ $T R A N S MED$ ’s predictions with well-established clinical knowledge about COVID-19 through univariate and multivariate risk factor driven sub-cohort analysis.$${\textsc {TransMED}}$$ $T R A N S MED$ ’s superior performance over state-of-the-art methods shows that leveraging patient data across modalities and transferring prior knowledge from similar disorders is critical for accurate prediction of patient outcomes, and this approach may serve as an important tool in the early response to future pandemics.
more » « less
Self-Supervised Learning of Contextual Embeddings for Link Prediction in Heterogeneous Networks

https://doi.org/10.1145/3442381.3450060

Wang, Ping; Agarwal, Khushbu; Ham, Colby; Choudhury, Sutanay; Reddy, Chandan K. (April 2021, Proceedings of The Web Conference (WWW))
null (Ed.)
Full Text Available
Application-Specific Graph Sampling for Frequent Subgraph Mining and Community Detection

Purohit, Sumit; Holder, Lawrence; Choudhury, Sutanay (December 2017, IEEE International Conference on Big Data)

Graph mining is an important data analysis methodology, but struggles as the input graph size increases. The scalability and usability challenges posed by such large graphs make it imperative to sample the input graph and reduce its size. The critical challenge in sampling is to identify the appropriate algorithm to insure the resulting analysis does not suffer heavily from the data reduction. Predicting the expected performance degradation for a given graph and sampling algorithm is also useful. In this paper, we present different sampling approaches for graph mining applications such as Frequent Subgrpah Mining (FSM), and Community Detection (CD). We explore graph metrics such as PageRank, Triangles, and Diversity to sample a graph and conclude that for heterogeneous graphs Triangles and Diversity perform better than degree based metrics. We also present two new sampling variations for targeted graph mining applications. We present empirical results to show that knowledge of the target application, along with input graph properties can be used to select the best sampling algorithm. We also conclude that performance degradation is an abrupt, rather than gradual phenomena, as the sample size decreases. We present the empirical results to show that the performance degradation follows a logistic function.
more » « less
Full Text Available

Search for: All records